Tree-based recursive partitioning methods for subdividing sibpairs into relatively more homogeneous subgroups.

نویسندگان

  • W D Shannon
  • M A Province
  • D C Rao
چکیده

We propose a new splitting rule for recursively partitioning sibpair data into relatively more homogeneous subgroups. This strategy is designed to identify subgroups of sibpairs such that within-subgroup analyses result in increased power to detect linkage using Haseman-Elston regression. We assume that the subgroups can be defined by patterns of non-genetic binary covariates measured on each sibpair. The data we consider consists of the squared difference of a quantitative trait measurement on each sibpair, estimates of identity-by-descent (IBD) values at each genetic marker, and binary covariate data describing characteristics of the sibpair (e.g., race, sex, family history of disease). To test the efficacy of this method in linkage analysis, we performed two simulation experiments. In the first, we simulated a mixture consisting of 66.6% of the sibpairs with no linkage and 33.3% of the sibpairs with genetic linkage to one marker. The two groups were distinguished by the value of a single binary covariate. We also simulated one unlinked marker and one random covariate to include as noise in the data. In the second experiment, we simulated a mixture consisting of 55% of the sibpairs with no genetic linkage, 22.5% of the sibpairs with genetic linkage to one marker, and 22.5% of the sibpairs with linkage to a different marker. Each subgroup was defined by a distinct pattern of two binary covariates. We also simulated one unlinked marker and two random covariates to include as noise in the data. Our simulation studies found that we can significantly increase the overall power to detect linkage by fitting Haseman-Elston regression models to homogeneous subgroups with only a small increase in the false-positive rate. Second, the splitting rule can correctly identify important covariates and linked markers. Third, recursive partitioning of sibpair data using this splitting rule can correctly identify sibpair subgroups. These results indicate that partitioning sibpairs into homogeneous subgroups is feasible and significantly increases the power to detect linkage, thus demonstrating the practical utility and potential this new methodology holds.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Decision trees in epidemiological research

BACKGROUND In many studies, it is of interest to identify population subgroups that are relatively homogeneous with respect to an outcome. The nature of these subgroups can provide insight into effect mechanisms and suggest targets for tailored interventions. However, identifying relevant subgroups can be challenging with standard statistical methods. MAIN TEXT We review the literature on dec...

متن کامل

ارزیابی متغیرهای پیش‌آگهی در رده‌بندی نرخ بقای بیماران مبتلا به سرطان کولورکتال با استفاده از درخت تصمیم

Background ; Objectives: Identifying the important influential factors is a great challenge in oncology studies. Decision tree is one of methods that could be used to evaluate the prognostic factors and classifying the patients' homogeneously. This method identifies the main prognostic factors and then determines the subgroups of patients based on those prognostic factors. The aim of this...

متن کامل

Aneurysmal subarachnoid hemorrhage prognostic decision-making algorithm using classification and regression tree analysis

BACKGROUND Classification and regression tree analysis involves the creation of a decision tree by recursive partitioning of a dataset into more homogeneous subgroups. Thus far, there is scarce literature on using this technique to create clinical prediction tools for aneurysmal subarachnoid hemorrhage (SAH). METHODS The classification and regression tree analysis technique was applied to the...

متن کامل

Original Contribution Applying Recursive Partitioning to a Prospective Study of Factors Associated with Adherence to Mammography Screening Guidelines

Although a number of predictors of adherence to mammography screening guidelines have been identified using traditional statistical methods, many women are not screening according to these guidelines. Recursive partitioning may aid in developing novel intervention strategies to promote this screening behavior by identifying subgroups of women that differ on adherence across predictor variables....

متن کامل

Application of Survival Tree Model in Determining Affecting Factors in Breastfeeding Duration

Background and Purpose: Survival tree model is a nonparametric method which can be used to identify the affecting factors from a specific time to the onset of an event. In this method, the categories are selected according to the most important factors. The purpose of this study was to determine the factors affecting the duration of breastfeeding in mothers and introduce the homogeneous subgrou...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Genetic epidemiology

دوره 20 3  شماره 

صفحات  -

تاریخ انتشار 2001